Deployment-Efficient Reinforcement Learning Via Model-Based Offline Optimization